Inferring Shared Interests from Tweets
نویسنده
چکیده
Social networks have intruded into the every day’s life. Users contribute with uploading content and sharing it with friends and public. From this information knowledge on who know whom, who communicates with whom, who meets whom and who shares the shame interests with whom can be extracted. These four aspects of human behavior can provide valuable insights while designing networking algorithms and protocols for opportunistic networks. Having efficient algorithms for exchanging information can help in the decentralization of the current centralized approached used to store the information. The distribution of the information could be quite useful in cases where the infrastructure is absent and thus fails to provide services. So far research on the first three topics and the relations between them has be done. In this thesis an effort on the introduction of how sharing the same interests correlates with the first three topics is done. In order to be able to do such a correlation, a measure on assigning similarity scores between users with respect to their shared interests needs to be created. In this thesis the problem of text comparison is investigated. A frequency of words based approach is chosen for the comparison of the texts posted on the social network called "Twitter". Five metrics are created based on special characteristic of twitter texts. The validation of the metrics is done in a set of 250 users. The results show that it is feasible to assign similarity scores between twitter users and thus provide a tool for measurements with the ultimate goal to deepen the understanding of how a pairs of users sharing same interests correlates with knowing each other, meeting each other or communicating with each other.
منابع مشابه
Tweet Ranking Based on Heterogeneous Networks
Ranking tweets is a fundamental task to make it easier to distill the vast amounts of information shared by users. In this paper, we explore the novel idea of ranking tweets on a topic using heterogeneous networks. We construct heterogeneous networks by harnessing cross-genre linkages between tweets and semantically-related web documents from formal genres, and inferring implicit links between ...
متن کاملA Location Inferring Model Based on Tweets and Bilateral Follow Friends
Inferring user’s location has emerged to be a critical and interesting issue in social media field. It is a challenging problem due to the sparse geo-enabled features in social media, for example, only less than 1% of tweets are geo-tagged. This paper proposes a location inferring model for microblog users who have not geo-tagged based on their tweets content and bilateral follow friends. An ap...
متن کاملMining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets
In this paper, we propose to study the problem of identifying and classifying tweets into intent categories. For example, a tweet “I wanna buy a new car” indicates the user’s intent for buying a car. Identifying such intent tweets will have great commercial value among others. In particular, it is important that we can distinguish different types of intent tweets. We propose to classify intent ...
متن کاملInferring Shared Tastes with Network Topic Models
It is widely assumed that people tend to gather in groups of shared interests, where such interests drive friendships and vice versa. Thanks to online social networking platforms, information about a users’s friends as well the items he is interested in are available, but represent only an incomplete picture. We study probabilistic network topic models that distill common shared interests of fr...
متن کاملInferring Shared Interests from Social Networks
Many online community platforms such as LibraryThing, last.fm, flickr, and CiteULike store data about users, friend relationships between users, and for each user a list of items he interacted with. Depending on the usage scenario, items may be books, songs, pictures, or scientific publications, respectively. In social network analysis it is widely assumed that people tend to gather in groups o...
متن کامل